AITopics | sampling bias

Collaborating Authors

sampling bias

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Robust Correction of Sampling Bias using Cumulative Distribution Functions

Neural Information Processing SystemsDec-23-2025, 20:58:58 GMT

Varying domains and biased datasets can lead to differences between the training and the target distributions, known as covariate shift. Current approaches for alleviating this often rely on estimating the ratio of training and target probability density functions. These techniques require parameter tuning and can be unstable across different datasets. We present a new method for handling covariate shift using the empirical cumulative distribution function estimates of the target distribution by a rigorous generalization of a recent idea proposed by Vapnik and Izmailov. Further, we show experimentally that our method is more robust in its predictions, is not reliant on parameter tuning and shows similar classification performance compared to the current state-of-the-art techniques on synthetic and real datasets.

cumulative distribution function, robust correction, sampling bias, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.42)

Add feedback

On the Origins of Sampling Bias: Implications on Fairness Measurement and Mitigation

Zhioua, Sami, Binkyte, Ruta, Ouni, Ayoub, Ktata, Farah Barika

arXiv.org Artificial IntelligenceMar-23-2025

Accurately measuring discrimination is crucial to faithfully assessing fairness of trained machine learning (ML) models. Any bias in measuring discrimination leads to either amplification or underestimation of the existing disparity. Several sources of bias exist and it is assumed that bias resulting from machine learning is born equally by different groups (e.g. females vs males, whites vs blacks, etc.). If, however, bias is born differently by different groups, it may exacerbate discrimination against specific sub-populations. Sampling bias, in particular, is inconsistently used in the literature to describe bias due to the sampling procedure. In this paper, we attempt to disambiguate this term by introducing clearly defined variants of sampling bias, namely, sample size bias (SSB) and underrepresentation bias (URB). Through an extensive set of experiments on benchmark datasets and using mainstream learning algorithms, we expose relevant observations in several model training scenarios. The observations are finally framed as actionable recommendations for practitioners.

artificial intelligence, log scale, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2503.17956

Country:

North America > United States (0.14)
Africa > Middle East > Tunisia > Sousse Governorate > Sousse (0.04)
Asia > Middle East > Qatar > Ad-Dawhah > Doha (0.04)
(3 more...)

Genre: Research Report > New Finding (0.95)

Industry:

Information Technology > Security & Privacy (0.45)
Law (0.34)
Education (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Robust Correction of Sampling Bias using Cumulative Distribution Functions

Neural Information Processing SystemsOct-9-2024, 18:56:47 GMT

cumulative distribution function, robust correction, sampling bias, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.48)

Add feedback

Effect Inference from Two-Group Data with Sampling Bias

Zachariah, Dave, Stoica, Petre

arXiv.org Machine LearningFeb-26-2019

In many applications, different populations are compared using data that are sampled in a biased manner. Under sampling biases, standard methods that estimate the difference between the population means yield unreliable inferences. Here we develop an inference method that is resilient to sampling biases and is able to control the false positive errors under moderate bias levels in contrast to the standard approach. We demonstrate the method using synthetic and real biomarker data.

artificial intelligence, estimator, machine learning, (16 more...)

arXiv.org Machine Learning

1902.09923

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.05)

Genre: Research Report (0.83)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.49)
Health & Medicine > Therapeutic Area (0.30)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.37)

Add feedback